INTEGERS WITH A PREDETERMINED PRIME FACTORIZATION 

ERIC NASLUND 

Abstract. A classic question in analytic number theory is to find asymptotics for ak{x) 
and TTf^{x), the number of integers n < x with exactly k prime factors, where TTk{x) has 
the added constraint that all the factors are distinct. This problem was originally resolved 

^sl ' by Landau in 1900, and much work was subsequently done where k is allowed to vary. In 

this paper we look at a similar question about integers with a specific prime factorization. 
Given a. G N^, a = (ai,a2, • ■ • , Q^fc) let aa{x) denote the number of integers of the form 
n = p"^ ' ' ' Pk" where the pi are not necessarily distinct, and let 7r„(a;) denote the same 
G ' counting function with the added condition that the factors are distinct. Our main result is 

I '~ , asymptotics for both of these functions. 
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^ ■ 1. Introduction 

^ \ One of the major problems in the 19th century was to find the growth rate of the number 

d ' of primes less then x, that is the function 
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^ \ In 1797, Legendre conjectured that 7r(x) is asymptotic to j^, written as 7r(x) ~ j^, which 

fr^ I means that we have the limit 



m . lim -^, = 1. 

Q , x^co xj log X 



Although a more precise conjecture was given by Gauss, little progress was made over the next 
50 years. In 1848 and 1850, Chebyshev made several contributions, and managed to prove 
weaker upper and lower bounds. A major breakthrough occurred in 1859, when Riemann 
published his seminal paper, "On the Number of Primes Less Than a Given Magnitude," 
C^_' in which he outlined a proof of Legendre's conjecture using complex analysis and the zeta 

function. In 1896, 99 years after Legendre made his conjecture, Hadamard and de la Vallee 
Poussin rigorously completed Riemann's outline, proving what is known today as the prime 
number theorem [31 • In particular, we can write down the explicit error term : 

(1.1) 7r(x) = -^ + 0' ^ 



log X \ log X 

but to be more precise than this we would need to introduce the function from Gauss's 
conjecture. 

A natural follow up question is whether or not we have similar asymptotics for the number 
of integers with exactly k prime factors. There are two reasonable ways to define the counting 
function; let crfc(a;) denote the number of integers less then x with exactly k prime factors, 
and let 7rfc(a;) be the same but with the added constraint that the k prime factors must 
be distinct. For convenience, we also define the sets V^ = \n: n = pi- ■ -pk] and V^ = 



{n : n = pi- ■ -pk where i ^ j ^ pi ^ pj} , so that we may write 

o-kix) = X^ 1 and nk{x) = ^^ 1. 

n < X n < X 

neV^ neVl 

In 1900 by Landau ^ found the growth rate of these functions, and he proved that for fixed 
k we have 

(1.2) ..(x)~..(x) -('°S'°S-)'" 



(A; — 1)! logx 

E. M. Wright then gave a short elementary proof of this in 1954 |3]. Heuristically we might 
expect this kind of asymptotic since X^fcLi c"fc(a^) = \_x\ , and if we could ignore the error term 
and sum over all k < logx, we would arrive back at this equality again as 

E Y^ x(loglogx) _ X ^ (log log X) _ 

^fcW~2^ (A;-l)!logx logx^ k\ ^' 

fc=l fe=l V ^ to to k=0 

Note that even though this works out, the heuristic is not entirely reliable. It seems to suggest 
that ak{x) ~ '^(■fcf. °)fio ^ even when k varies with x, which is not true when k ~ log log x \i\. 
In his paper. Landau also gave explicit error terms, and showed that for k >2 

(1.3) a,{x) ^^aoglog^)'"^o^aoglog^)'" 



{k — 1)! logx y logx 

and 

x (log log x)''"^ / a; (log log x)''" 

(1-4) 7rfc(x) = — -— hO' 



(A; — 1)! logx I logx 

where the notation 0{f{x)) means that the error term is bounded in absolute value by 
some constant multiple of f{x). (Although seperated on different lines, note that the above 
asymptotics are indeed the same.) In this paper we are interested in something very sim- 
ilar, which is counting the number of integers of a particular shape, integers of the form 
Pi^ ■ • ■ J9,"" where the Oj are fixed exponents. For example, we may ask how many integers 
of the form pq^ are there less than x. To discuss this problem, we begin by introducing 
some notation. Given a vector ex. = (ai, ■ ■ ■ , a^) ^ ^'^, define a^ix) to be the num- 
ber of integers n < x oi the form n = p°^ ■ ■ -p^*, allowing prime repetitions, and TTct{x) 
to be the number without prime repetitions. If we set V^ = {n : n = p^^ ■ ■ ■p"''}, and 
V^ = {n : n = p^^ ■ ■ ■ p"'' where i ^ j ^ Pi ^ Pj}, then as was done for TTk{x), and crfc(x), we 
can rewrite these counting functions as 

o-a(x) = X] 1 and 7rc»(x) = ^^ 1. 

n < X n < X 

neV^ neVl 

Our goal is to provide asymptotics for o"q(x) and 7rQ,(x), and our main theorem is: 

Theorem 1. Let r,a he positive integers. Suppose we have a vector of the form a = 
(a, ■ ■ ■ ,Q;,ai,--- , a^) G M*^"'"'', where k > is the multiplicity of a, and where a < ai 
for all i. Then if (3 = (ai, ■ ■ ■ , a^) G N*", we have 

o-a (x) ~ at ix~) ^ n'^ 



and 
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The above theorem tells us that the higher powers introduce a constant factor into the 
asymptotic since both of the series 'YlmeV''^~~ ^"^^ Sne'P'"'^"" converge absolutely. The 
convergence of these series follows from the fact that — > 1 along with equation 12.11 in the 
next section. In particular, returning to our previous example of counting the number of 
integers of the form pq^ less than x, we have that PS = V% = {p"^ : p is prime}, and hence 

7r(i,3)(x) ~ cr(i,3)(x) ~ y] — = 1 ^(3) 

log X ^-^ p^ log X 

where P(s) = ^„p~* is the prime zeta function. We can ask whether the constant can always 
be rewritten as a product of prime zeta functions, and this is answered by the following 
theorem: 

Theorem 2. Suppose we are given a < ai < ■ ■ ■ < ar , and that for any choice of ti G 
{—1, 0, 1}, we have ^^ eiOj = implies e^ = for every i. Then 



E"--n^(^ 



n&V-p 



where P{s) = ^ p * is the prime zeta function. This is equivalent to the condition that 
every n G VZ, where (3 = (ai, . . . , ar), has a unique representation as n = Pi^ ■ ■ ■p"''. 

For example, the above two theorems imply that the number of integers of the form n = 
PiP2P3P4:PI^ , with n < X, will be asymptotic to 

a2 (x) P(3)P(5)P(19) ~ ^loglog^ p(3)p(5)p(i9)_ 

logx 

2. The Main Result 

It is very important to split up the smallest power, as this is contributes the most to the 
sum. Throughout this section, we write our vector of exponents as ck = (a, ■ ■ ■ , a, ai, ■ ■ ■ , a^) ^ 
N*^"'"'', with 1 < a < «! < ■ ■ ■ < ar, where fc > is the multiplicity of a, and let 
f3 = (ai, ■ ■ ■ ,ar) G N''. To start, we provide a simple upper bound for o"/3(x). Notice 
that 

Tifiix) < apix) = Yl 1 < XI ^' 
m < X Pi--Pr'^<x 

where the right hand sum ranges over all vectors of primes of length r satisfying p"^ ■ ■ ■ p^'' < 
X. Since ai < a^ for all i, and p"^ ■ ■ •p"'' < x implies that p" ^2^ ' ' 'Pr^ ^ 2;, we see that 
replacing every exponent by ai only increases the sum. Then using [TT^ we have 

(2.1) x,W<M^)< E i = ofx-*fi25M^"' 

^-^ ^ \ logx 

pi---pr<X°'l 

The following subsection is devoted to examining o"a(x). The key will be using the hyperbola 
method, and most of the lemmas will apply identically to the proof for 7ia{x). 



2.1. aa{x). Each integer n G V^ has one part in V^, and one part in VZ, and our goal will 
be to split it up between these two to better understand aa{x). With this in mind, we might 
expect 

(Tcix) ^ ^ 1. 

17171°" < X 

However, this will not be an exact equality as an integer k < x with k G V^ may have more 
than one representation of the form k = miff with n G P^, m G VZ. Since k G V^ can 
have at most one representation of the form k = mn" with n G VJI, m G VZ, we have the 
inequalities 
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1 < <'c{x) < E ^' 



mn" < X mrf < x 

neV^,meV^ neV^,meV^ 

Rewriting so that we first sum over m, this is 
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mn°' < X m < X 
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(2.2) 



Our first goal will be to remove all of the terms from the sum with m > „ ^ ,c for some 

— (log x) 

constant C > 2, without introducing large error. For example, we could take C = 3 to prove 
the asymptotic. However to achieve the optimal error term we need something of the form 
C = 2aai + 1, a choice which will become clear later on. Note that we need only bound this 
sum for (Tk{x), since Tiki^x) < ak{x), and this is covered by the following lemma. 

Lemma 3. For C > 1 we have that 
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Proof. We may change the order of summation and write 
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Using 12.11 this is bounded above by 



n" < --^-T; m < ^ 



E 



O 



n" < 



x"i (log log (x/n")) 
n^ log (x/n") 



r-l 



(logx) 



o 



n&VZ 
( 



X"! (log log X 



,r-l 



E 



n" < 



n'^i 



(log a;) 



Taking the trivial bound, the inner sum becomes 
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SO that we have the upper bound 
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Combining 12.21 along with Lemma |3] and Landau's estimates II. 3[ [L4l for k> 1 yields 
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(2.3) 
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and for A; = 1 by ll.H the prime number theorem, we have 



cr„ X 



(2.4) 
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If we write (log (^ log (^))) = (log log (^) — log a) and then expand using the bino- 
mial theorem, all of the terms will be consumed by the error term except for the one with 
(log log (^)) , which allows us to change the main term in the above to 



(2.5) 
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We may clean up the error terms by bounding each part of the sum from above. Since 
T^ < (logx) , — 7-^ is bounded above by 



1 



loff ( ^-^] log (x) - C log log X log X 
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We also have the trivial bounds 



1 / log log X 

log^x 



log ( -log ( — ) ) < (log (log (x))) 
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since the right hand side is a convergent series. Combining these, for integers A > 0, B > 1 
we have that 
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which gives an upper bound on the error term in both cases, k = 1 and k > 1. The following 
lemma allows us to deal with the main term: 



Lemma 4. For C > 1, we have that 
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Proof. First, note that we have the bounds 



1 1 1 

< ^^ < 
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Using power series expansions we may write 

1 1 



< (log log (a;)) 
,Q /log log X 
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Then [2^ implies that 
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Let C = 2aai + 1 so that (C - 1) [l - ^) = 2a (ai - a) > 2. Upon combining |231 
and lemma H] for fc > 1 we obtain 
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x<i (log logx 
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Tn < (logx) 
Similarly, 12. 4[ 12. 6[ and lemma S] together yield 
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ioY k = 1. To deal with the last sum, write 
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Applying summation by parts, we have that 

JnoRx)^ 
m > (logo;) 
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Then by 12.11 this becomes 



1 /"°° 



(logx)C « ^(logx) 



O (logx) V"i "/ (log log log x)'' ] = O , 2 

V y V(loga;) 

since C ( — — - ) = —2 + (— — - ) . Thus for fc > 1 we have 

\a\ a J yen «/ 

/oQ^ ( \ x^ (log log x)^'"^ ^^^ -^ , n( ^Ooglog^' 

(2.8) a^{x) = a————— ^m ^ + O \x^ 
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m£V^ 



-, > m - + O 



(A;-l)!loga; J^^ \ log a; 

and for A; = 1, 

(2.9) aJx) = a-^^ V m"i + O 

This yields the desired asymptotic 

/o1n^ ^ ^ x^ (log log x)''"^ >r;^ _i 

and since 

/ lA x'^ (log log x) 

"' V'V ^ " (A:-l)!logx 
by Landau's estimates II. 2^ we conclude that 



(2.11) a„ix)r^aJx^) J^ 
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proving the first part of Theorem [H 



2.2. TTaix). To prove the same result for 7ra{x), we start again by splitting integers n G V^ 
into two parts, one in V^, and one in PS. With this in mind we consider 

n°'m < X 

This will be strictly larger then TTa{x) since n and ?7i may have prime factors in common. 
(Note that since all factors are distinct, we cannot have multiple representations k = mn.) 
However, we can throw out all of the terms for which gcd {m, n) > 1 without affecting the 
asymptotic. Write n = qi- ■ ■ q^, and m = p"^ ■ ■ ■p"''. If gcd (m, n) > 1, then we must have 
qi = pj for some i,j. The set of all tuples with q^ = pj is bounded above by 

^«,,,(a;) 



where ckjj = {a, . . . ,a, ai, ■ ■ ■ , {aj + «),■■■ , ar) G N^ ^^"^ and we have k — 1 copies of a. In 
particular, by l2.10[ we see that 



.X ^1 i(loglogx)'' ^ 



cr„, ,(x) = Of a;° 
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for k > 1, and 



for any e > when k = 1. Since there are at most k ■ r possible pairs {i,j), it follows that 
for A; > 1 

f \ Sr T < n f i (log log x)^"^ 

TTc{x) = y^ 1 + X- — - — 

■^^ \ log X 

rfm < X ^ 

and a similar error term as before when k = 1. The main term may be rewritten as 

j: j: ^- j: -((^" 

m < X 71°" < — m < X 

— — m — 

me VI neVl meV^ 

and from here, following through the exact same sequence of steps and lemmas from the 
previous section will yield 



Effx\^\ x<i (loglogx)^ ^ -r^ _i 

""'[U J^" (fc-l)!logx ^"^ "■ 

m < X m&V^ 

All of the upper bounds for o"a(x) still apply to -Rctix), and the only change is that we are 
summing over Vg rather then VZ, which is why the final sum is different. Using [L2| we get 
that 



(2.12) 7r„(x) ~ TTfc (x^) Yl 



m °' . 



meVp 



proving the second part of Theorem [H If the error term is kept throughout the above 
computations, we get the more precise 
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3. The Constant Factor 

Let a > be given, let A = {ai, ■ ■ ■ , a^} where a < «« < aj for all i,j, and set set f3 = 
[ai, . . . , ar). If every n G VZ has one and only one representation of the form n = p"^ ■ ■ ■ p"'', 
then we may decompose the sum as 

neV^ pi P2 Pr 



This equals 



V Pl / \ Pr / 



which by definition of the prime zeta function, P{s) = 'Y^ p **, is 

We now show that each integer can be uniquely represented if and only if ^^ e^Ui = 
with ei G {—1,0,1} implies that every e, = 0. Suppose we are given e^, not all zero, with 
Yi €-iOLi = 0. Then we have then we have aj^ + a^j + ■ ■ ■ + ai^ = aj^ + aj^ + ■ ■ ■ + aji = M 
for some M where each all of the in and jm are distinct. Setting Pi^ = ■ ■ ■ = Pi^ = p, and 
Pji = ■ ■ ■ = Pji = q, we will have a factor of q^p'^ , and this allows us to permute q and p 
giving two representations of the same integer. Conversely, if we have two representations of 
the same integer, then it must be because of a factor of the form g^^p^^, which implies that 
we must have Yli^i'^i ~ f*^^ some non zero choices e^. This then completes the proof of 
Theorem [2l 
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